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computer programs for multivariate analysis of variance. As with 
earlier programs, analysis of variance, univariate and multivariate, 
is the main target of the program. Correlation analysis of all types 
is available with printout in the vernacular of correlation. In 
addition, VARAN includes several styles of factor and component 
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Introduction f 

This memorandum is the manual for the VARAN program. (The acronym 
comes from the words VARiance ANalysis) 

The VARAN program is the latest addition to a series of computer 
programs for multivariate analysis of variance which originated about 1957- 
About that time R. Darrell Bock had a MANOVA type program running on the 
UNIVAC at the University of North Carolina. Bock's original program 
handled only a few variables and was otherwise quite restricted and was 
not in much general use. Later, in 1962, C. E. Hall and Elliot Cramer 
with Bock's assistance published a program called MANOVA in FORTRAN 2 which 
was internationally circulated. The program handled 25 variables and 100 
degrees of freedom for hypothesis. 

The wide use of this program prompted further development by Cramer 
and by Bock and Finn. In 1966 Cramer published a program called MANOVA 
in FORTRAN k followed by Finn and Bock's program MULTIVARIANCE also in 
FORTRAN k. The development of these two programs greatly increased the 
scope of multivariate analyses which could be performed on computers. 

MULTIVARIANCE was the first of the programs in this series that 
utilized the full linear model. The earlier programs had been restricted 
to models of the cell means in their main streams of calculation. Other 
variations on the linear model, like canonical correlation, were of an 
"accidental" nature. With the development of MULTIVARIANCE, the series 
turned to the flexibility of the full linear model. 

The development of VARAN continues this series in the exploitation 
of the full linear model. Analysis of variance, univariate and multivariate, 
is the main target of the program as with the earlier programs. Corre- 
lation analysis of all types s available with printout in the vernacular 
of correlation. The hybrid of these, homogeneity of regression, has been 



added with as much flexibility as can be currently mustered. In addition 
to these, VARAN includes several styles of factor and component analysis 
complete with tests of factorization and rotation techniques. 

The addition of factor analysis and correlation techniques to a 
MANOVA program brings several new capacities to multivariate analysis in 
addition to the customary univariate and multivariate ANOVA, correlation 
and homogeneity of regression problems. Some of the new capacities for 
analyses are listed below. 

1. Three kinds of battery reduction procedures from multiple 
correlation applied to discriminant analysis, multivariate analysis of 
variance and canonical correlation. 

2. Seven kinds of rotation schemes from factor analysis applied to 
discriminant analysis, multivariate analysis of variance and canonical 
correlation. 

3. Factor extension from factor analysis applied to discriminant 
analysis, multivariate analysis of variance and canonical correlation. 

k. Estimation of population variance and covariance parameters 
from several samples as found in analysis of variance applied to corre- 
lation and factor analysis models. 

5. Use of "dummy parameters" from analysis of variance applied to 
factor analysis, analysis of variance (obtaining intraclass correlation 
coefficients) and correlation analysis (complex biserial correlation)* 

6. Homogeneity of regression techniques from analysis of covariance 
applied to correlation analysis from multiple samples and factor analysis 
from multiple examples. 

7. Interaction tables for ANOVA models. 

8. Variance reduction analysis for determining the effects of non- 
' orthogonality in ANOVA analysis. 



9. Dimension reduction analysis for determining overlap of effects 
in MANOVA and complex canonical correlation analyses. 

The VARAN program was written more in the style of a mathematical 
exercise in linear calculations than as a solution for specific statistical 
models. Therefore, the user can expect to find applications in analysis 
which were not specifically contemplated by the autnors. The VARAN program 
was also constructed to be easily expanded to include new linear uodel 
techniques as they are generated. Updates will be forthcoming periodically 
and for this reason it is suggested that all copies be obtained either 
from Educational Testing Service or from the authors. 

As is customary with programs of this size, the author makes no 
ironclad claims of arithmetic accuracy. Any errors discovered by the 
users vill be quickly corrected and distributed to other users. It should 
be noted that there were 85 or more problems run to check the accuracy 
of the main streams of calculation. It may also be noticed that the 
ability to make linear transformations of the data provides a wide variety 
of internal checks on calculation accuracy. 

All arithmetic is single precision except for orthogonal polynomial 
construction. The user is warned that lengthy manipulations of highly 
correlated variables are not advised. (This is not much of a deficiency 
in handling ANOVA designs since data are not generally useful when highly 
correlated and cell counts are generally close to orthogonality.) 

The authors wish to express their appreciation to the programming 
staff of Educational Testing Service for three years of assistance. 
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Control Cards in Brief 



TITLE Cards : ( optional) 
Cols 



1-4 
5-80 

PROBLEM Car d: 
Cols 
1-U 
5-6 
8 
10 

12 
Ik 
17-18 

20 

22 
2k 
26 



TITL 

Any alphameric description 

As many cards may be used as desired 

(required) 



PROB 

Number of variable format cards for data (10 or fever) 

Number of contrast card sets (at least 2, at most 7) 

Number of individual significance test control cards 
(at most 6) 

Print cell means and variances (l means yes)* 
Print reduced model matrix (l means yes) 
Data file number if not cards 

Print only estimates or raw regression coefficients 
(l means yes) 

Controls printed output (0 gives minimal output). 
See Table I 

Controls printed output (0 adds nothing to above). 
See Table I 

Type of covariance adjustment (0 or 1 does the 
classical adjustment; 2 uses "error" regression 
weights for reduction) 



Contrast Card Set: (At least two required) 
Cols 



Letter identification of the factor (W is not allowed); 
V is allowed only for continuous variables 



blank always means "no" 



-5- 



Cols 

2-3 Contrast code for design variables 

= 0 Regular contrasts (also called nominal or 
deviation) 

= 1 Special contrasts 

= 2 Orthogonal polynomials (integral values) 
(19 levels or less) 

= 3 Reverse Helmert contrasts 

= h Special design parameters 

= 5 Orthogonal polynomials (i.ormalized values) 
(39 levels or less) 

Function code for continuous variables (after V) 
= 0 do nothing 

= N obtain Nth power of all variables (N > l) 

1^6 Number of levels of a factor (at most ^0) or, after V, 

the number of continuous variables (at most 39) 

8 Read in variable names (l means yes) 

10 Linear transformations of all the variables to be 

read in (l means yes) 

2 col Number of variables in each partition (if no partitioning, 

fields do nothing) (at most NORD partitions) 

Next 2 col , , (two commas required) 

field 

Next 2 col Recoding of all the factor identification or, after V, 

field reordering of all the variables (if no recoding, or 

reordering, do nothing) 

Next 2 col (two commas required) 

field 

20. col Any description of the contrast or variables 

* field 

1 col . (a period required) 

h. Contrast Card Set Options : (optional) 

a. Variable names cards. FORMAT (10A8) 

Any names in 8 column fields, 10 names per card. The name cards 
must precede other options. 
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b. Special contrasts matrices or special design parameters. FORMAT 
(8F10.0) 

Contrast or design matrices are entered a row at a time, the row 
for the grand mean being first. Contrast and design matrices are 
square and of dimension equal to the number of levels of the factor. 

c. Orthogonal polynomial metric. FORMAT (8F10.0) 

The coefficients of the linear polynomial fit only. 

d. Linear transformation of all the continuous variables. FORMAT 
(8F10.0) 

Each regression coefficient is punched in a 10-column field, 8 
coefficients per card, as many cards as necessary for each regression 
equation. Each equation begins on a new card. The transf ormat ion 
matrix must be nonsingular square and of order equal to the total 
number of continuous variables. 



5- Significance Test Card (required) 

This card contains the model statement and indicates the tests to be 
calculated. The model statement is limited to at most 9 ce.rds. Some of 
the more common models are these. 

a. Two way factorial analysis of variance 
MOVA:V^,A=V,fcV,AB=V. ^ K 

b. Correlation analysis 
CORREL:tt=0,Vl=V2. 

c. One way analysis of covariance 
ANCVA:Vf=0 ; A^Vl/V2. 

d. Minres factor analysis with varimax rotation 
FACTOR8#2:tt=0,0=V. 

e. Interaction Tables in a three way factorial 
I NTABL : WAE=V , WAO=V , WBO= V . 

The following symbols are used: 

ANOVA, CORREL, FACTOR, I NTABL and VARED are acronyms to begin the 
card and state the kind of model 

: to separate acronym from model statement 

Letters are used to denote ANOVA-type "main effects 11 
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Letter followed by a numeral for partitions (except after W) 

A sequence of letters or letters and numerals without symbols between 
for interaction effects 

+ to indicate pooling of effects 

= to separate hypothesis variables from error variables. 

In this description any variable or variables 
to the left of an equal sign is called a "hypothesis 11 variable; 
any variable or set of variables to the right of the equal 
sign is called an "error 11 variable except if it is also to the 
right of a slash (/) or an ampersand (&). 

- (minus) for dimension reduction 

, (comma) to separate tests 

/ to indicate that covariates follow 

& to indicate that extension variables follow 

* in an error term between two sets of variables to 
indicate that the regression of the left set on the 
right set is to form an error tenn for an analysis 

0 (a zero) as an error term indicates that the hypothesis 
term is to be included in the model but not tested 

0 (a zero) as a hypothesis term indicates that the 
error term is to be used in a factor analysis 

V as a hypothesis term indicates the constant term 
or the grand mean 

W in a letter-number sequence indicates that the effect 
on the left is nested in the effect or cells on the 
right 

a digit is required after FACTOR to indicate the type 
of factor analysis, see Table II 

# after FACTOR and its digit indicates that rotation of 
the factors is to be done: a digit must follow this to 
indicate type of rotation, see Table III 

. (a period) must end the statement 



ANOVA denotes an analysis of variance model, multivariate or univariate; 
CORREL denotes a correlation model, product moment, multiple or canonical; 
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FACTOR denotes a factor analysis model, including principal components; 
INTABL denotes interaction tables for analysis of variance; VARED denotes 
a variance reduction study of correlation and analysis of variance problems. 

6. Variable Format Cards (required) 

These cards describe the data records p.nd are the usual variable format 
cards of FORTRAN. Ten cards are allowed. (There is no fixed format data 
option in this program.) It is required that cell identification be 
read in before the scores on the continuous variables- 

7. Data (required) 

The data may be on cards or a computer kept file (see columns 17-18 of 
the PROB card). If the data are on cards, put enough blank cards after the 
data to make a blank, but complete, data record. If the data are on a tape 
or disk file, the end-of-file mark will signify *2ie end of the data. 

8. Individual Significance Test Control Car ds (optional) 

These cards control the use of expository calculations on designated 
tests of the model. At most 6 of the significance tests of the model can 
be subject to these techniques. 

Cols Information 

1-2 The number of the significance test to which this infor- 

mation applies (obtained by counting equal signs from 
the right [or the period]) 

3-k Probability statement times 100 for battery reduction, 

dimension reduction and rotation of canonical variates 
in ANOVA and CORREL 
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Cois Inf onnat ion 

6 Type of battery reduction procedure -a % . variables 

1 for Efroymson's stepwise addition/elimination 
procedure 

2 for Wherry-Doolittle stepwise addition procedure 

3 for Step-up variable elimination procedure 

7-8 Number of factors to be obtained in a factor analysis. 

For components analysis, the number of components is 
always the number of variables and for principal factor 
analysis this is always one less than the numl/.v of 
variables; no entries are required for these solutions. 
All other solutions require an entry here, 

10 Factor analysis tests 

Blank for none 

1 for Rao's test in canonical factor analysis 

2 for Rippe's test 

12 Type of rotation technique (see Table III) 

1*1 1 for direct rotation of canonical variates; all factor 

analys is 

2 for indirect rotation of canonical variates 

16 1 for taxonomy of variables 

2 for taxonomy of groups 

17-18 Power of 0.1 (multiplied by 10) for the convergence 

criterion in uniqueness iterations of canonical factor 
analysis (see Table IV). Entering l6 produces a 
criterion of = .0251189 



20 



rid 



1 if squared communalities are supplied for factor analysis 

1 if weights are supplied for rotations of factors or 
canonical variates 



Individual Significance Test Card Options, (optional) 

a. Communalities for factor analysis. FORMAT (8F10.0) 

Squared communalities are entered in 10 column fields, 8 per card, 
for as many cards as necessary. 

b. Weights for rotation of factors and canonical variates. FORMAT 
(8F10.0) 
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Weights are entered in 10 column fields. For each canonical variate 
or factor there must be as many weights as there are variables. The 
weights for each canonical variate must begin on a new card. When 
rotating canonical variates in ANOVA or CORREL, there must be as 
many sets of weights as there are canonical variates; if the rotation 
is a taxonomy of groups , each set of weights will have only as many 
weights as there are groups. For principal components, weights 
must be supplied for all components (as many as there are variables). 
For all forms of factor analysis weights must be supplied for as 
many factors as are indicated in columns 7 and 8 of the individual 
significance test card or, if this is zero, as many as the number 
of variables. 

10. TITLE Cards (optional) 

TITLE cards may be used here to identify reanalyses. 

11. Reanalysis Card (optional) 
Cols 

1-k ANLY (not MALY) 

6 1 if a new significance test card is used 

8 number of new contrast card sets 

(must be used for changing the partitioning of the 
continuous variables) 

The following features are the same as on the PROB card and appear in 
the same card columns , 



10 Number of individual significance test cards for this 

analysis 

12 Print cell means and variances (l means yes) 

Ik Print reduced model matrix (l means yes) 

Data file number not necessary 
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Cols 

20 Print only estimates or raw regression coefficients 

(l means yes) 

22 Controls printed output (0 gives minimal output). 

(See Table I) 

2k Controls printed output (0 adds nothing to the above). 

(See Table I) 

26 T*ype of covariance adjustment (0 or 1 does the classical 

adjustment, 2 uses "error" weights for reduction) 

All other control cards and their options remain the same as for the 
original analysis. 

12. Several Problems 

Many problems may be submitted with a single run. 

13. FINISH card 
Cols 

1-6 FINISH 



7 
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Table 1 

Printed Output Controls 

Information which is printed when column 22 and column 
2k of the PROB or ANLY card are coded. 





Codes 


Cross- index 


Col 


22 


Col 2k 


for 

Information 


Table 
IX 


Correlation 




2* 


** 


Correlation among the hypothesis variables 


a 


b,2 






Standard deviation of hypothesis variables 




b,l, 


2 




Cross-correlations between hypothesis and error 
variables 


b 


2 




3 


Raw score weights for regressing hypothesis 
variables onto error variables 


c 


2 




3 


Standard score weights for regressjng hypothesis 
variables onto error variables 


d 


2 




1.3 


Standard error about the regression line(s) 




b,l, 


2 




Correlations among the error variables 


e 


b,2 






Standard deviations of the error variables 




1,2 






Regression sums of squares when total sums of 
squares of error variables are unities^ 


f 








Statistical Summary 










Dimension Reduction Statistics (Canonical Correlation 
only) 



^Punching a blank (b), a 1 (one) or a 2 (two) forces printing of the information. 
**This designates that col 2k does not control printing of this information. 
Numbered footnotes can be found at the end of the table. 
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Codes 

Col 22 Col 2h Information 

b(2 ... „»™ F statistics tor m «W 

b 2 Correlations between the variables and the 

canonical variates within the error set 

I 2 Standard score weights for regressing variables 

onto canonical variates within the error set 

2 l Raw score weights for regressing variables onto 

canonical variates within the error set 

2 2 Weights for regressing raw score variables onto 

error canonical variates with unit sums of 
squares 

12 Regression sums of squares when total sums of ^ 

squares of hypothesis variables are unities 

b 2 Univariate F statistics for hypothesis variables 

b 2 Correlations between the variables and the 

canonical variates within the hypothesis set 

b 2 Standard score weights for regressing variables 

onto canonical variates within the hypothesis 
set 

2 l Raw score weights for regressing variables onto 

canonical variates within the hypothesis set 

2 2 Weights for regressing raw score hypothesis 

variables onto hypothesis canonical variates 
with unit sums of squares 

3 12 3 Print only the information indicated in column 

Analysis of Variance 

b,2 Haw estimates of effects m 

3 

12 Standardized estimates of effects o 

ii 

2 1 Orthogonalized estimates of effects n 

b ? 2 Univariate standard errors 
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Codes 



Col 22 Col 2k 
1,2 

1,2 



b,2 
b,2 

b,2 



Inf ormatj cn 

5 

Error dispersions reduced to correlations 

Hypothesis sums of squares when error sums 
of squares are unities 

Hypothesis suns of squares and error sums 
of squares (univariate) 

Statistical summary 

Dimension reduction statistics 

(discriminant analysis and MANOVA only) 

Univariate F statistics 

Correlations between the variables and the 
canonical variates 

Correlations between the variables and the 
canonical variates weighted by the square r 
roots of the associated canonical variance 1 

Discriminant function coefficients for 
standard scores 



p 
f 



6 
h 



q 

r 



b,l,2 

2 



Discriminant function coefficients for raw 
scores 

t 

Estimates of effects for the canonical variates' 

Transform? tion matrix for obtaining canonical 
contrasts ( unreduced )9 



Factor Analysis 



Correlation matrix or covariance matrix (as 
appropriate) 

Estimates of standard deviations"*"^ 

Communality estimates, when applicable 



Final communality estimates (canonical factor 
analysis only) 



Characteristic roots and vectors 
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Codes 

Col 22 Col 2k Information 

Factor coefficients (unrotated) 

Images and anti images (image analysis only) 

Canonical correlations (Canonical factor analysis 

only) 

1 Regression weights for factor scores (total 

variance procedures only) 

Statistical tests (if used) 

1 Residuals from the correlation matrix 

1 Mean and standard deviation of residuals (below 

the diagonal) 

1 Frequency distribution of residuals (if program 

is set up to handle 20 or more continuous 
variables ) 

Variance Reduction 



1 Sum of squares among design parameters and 

percentage loss 

1 Correlation among design parameters before 

reduction 

1 Correlation among design parameters after reduction 

1 Sum of squares among contrasts and percentage 

loss 

1 Correlations among contrasts before reduction 

1 Correlations among contrasts after reduction 

1 Correlations among hypothesis sums of squares 

before reduction 

1 Correlations among hypothesis sums of squares 

after reduction 

Data on trace of hypothesis sum-of- squares matrix 

Hypothesis variance data for individual variables 

and percentage loss 
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Footnotes for Table I 



1. Regression sums of squares when the total sums of squares of the error 
(hypothesis) variables are unities. This matrix has on its diagonal 
the squared multiple correlations between the error (hypothesjs) 
variables and all hypothesis (error) variables. The off-diagonal 
elements become the correlations among the regressed variables when 
they are divided by the square roots of the diagonal elements* 

2. Univariate F statistics for error (hypothesis) variables. These are 
the F tests of the multiple correlations for each error (hypothesis} 
variable regressed on all the hypothesis (error) variables. 

3. Standardized estimates are the raw estimates divided by their standard 
errors. A standardized estimate of +1.0 is one standard error above 
the grand mean of the data. These figures are easy to relate to 
confidence intervals about the grand mean. 

*4. Orthogonalized estimates are appropriate in nonorthogonal designs. 

They are what is left of the raw estimates after the nonorthogonality 
of the design has been accounted for. The analysis which is produced 
is an analysis of these estimates. Comparison of the raw estimates 
with the orthogonalized estimates is sometimes useful in determining 
the effects of nonorthogonality on the analysis. In E. Cramer 1 s 
MAHOVA these are the "Estimates." 

5. ' Error dispersions reduced to correlations. In a multivariate analysis 

of variance the error term is a variance-covariance matrix. This is 
that error term reduced to a correlation matrix. 

6. Hypothesis sums of squares when error sums of squares are unities. 

The diagonals of this matrix when multiplied and divided by the degrees 
of freedom give the univariate F-ratios. The off-diagonal entries are 
a type of "covariance F" and reflect the relationships among treatment 
effects on the variables. 

7. Correlations between the variables and the canonical variates weigh ced 
by the square roots of the associated canonical variance. These values 
are related to "Student's" t: when the analysis is of two samples and 
a single variable, it is "Student's" t. In a MANOVA or discriminant 
analysis, summing the squares of these values for one variable across 
the canonical variates will produce the univariate F ratio for that 
variable. 

8. Estimate of effects for the canonical variates. These are the mean 
discriminant scores when the grand mean is zero. These estimates always 
add to zero for each canonical variate (or discriminant variable). 
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9. Transformation matrix for obtaining canonical contrasts. This matrix, 
when used to multiply the contrasts, produces the canonical contrasts. 
In orthogonal designs it produces exactly the canonical contrasts; in 
nonorthogonal designs it produces the canonical contrasts ignoring 
adjustments for the lack of orthogonality. 

10. Estimates of standard deviations. This program always assumes that 
the data are a sample and not a population. These are not standard 
deviations but sample estimates. The estimates are residual to any 
covariates or sampling design. 
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Table II 
Factor Analysis Codes 



Codes 


Type of Factor Analysis 


1 


Principal components of dispersion 


2 


Principal components of correlation 


3 


Principal factor analysis 


k 


Image analysis 


5 


Canonical factor analysis 


6 


Maximum likelihood factor analysis 


7 


Alpha factor analysis 


8 


Minres factor analysis 



Table III 

Factor Canonical Variate Rotation Codes 



Codes 


Type of Rotation 


1 


Quart imax 


2 


Varimax 


3 


Equamax 


I* 


Promax 


5 


Multiple groups 


6 


Orthogonal centroids 


7 


Orthogonal bounds 
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Table IV 



Power of .01 for Convergence Criterion 



Power x 10 



Criterion 



11 
12 
13 
111 
15 
16 
IT 
18 
19 
20 



.079^329 
.0630957 
.0501188 
.0398107 
.0316227 
.0251189 
.0199526 
.01581*90 
.0125893 
.0100000 



Note : It so happens that the criterion is modular in the 
second digit of the power of 0.1 while the first digit 
determines the numter of zeros after the decimal; i.e., 
use of 2k gives a criterion of .00398107 and kk gives 
a criterion of .0000398107. 
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Control Cards in Detail 

TITLE Card 
Cols 

l- 1 * TITL these letters exactly 

5-80 Any description that can be keypunched 

As mavj TITLE cards may be used as the user 
sees fit. All cards must have the letters 
rlTL in columns 1 to k 



PROBLEM Card 
Cols Codes 



1-U 
5-6 



NFMC 



NCONT 



Explanation 

PROB these letters exactly 

Number of variable format cards used 
(10 or fewer) 

The variable format cards are used to describe 
the scores as they appear on the data 
file. This information tells how many 
variable format cards are necessary to 
describe the format of the scores. 

Number of contrast card sets 
(at least 2; at most 7) 

Each contrast card set describes a factor 
of an analysis of variance design except 
that one additional set is used to describe 
the continuous variables of the linear model. 
Thus a one-way analysis of variance model 
must have two contrast card sets: one for 
the design and one for the continuous vari- 
ables. A simple correlation problem must 
have a dummy contrast card set even if it is 
only one sample, and one set for the con- 
tinuous variables. 



In many problems tne contrast card sets will 
have only one card per set. 
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Cols Codes 
10 INF028 



12 
Ik 

17-18 



MPRINT 



KPRINT 



INFILE 



20 



INF021 



22 



INF09 



2k 
26 



INFO10 
INFO 1 6 



Explanation 

Number of individual significance test 
control cards (at most 6) 

For any tests calculated it is possible 
to enter an individual significance test 
control card to control use of any of 
several expository techniques such as 
rotation, battery reduction or the like. 
At most 6 tests may use these tech- 
niques, so that at most 6 individual 
significance test control cards may be 
used. 

Punching a 1 (one) forces printing of 
the cell means and variance; leaving 
a blank avoids calculation and printing 

Punching a 1 (one) forces printing of the 
reduced model matrix 

If the data are on cards and submitted 
with the set-up cards, leave blank. 
If the data are on a computer kept file, 
punch in the file number 

Punch a 1 (one) when only estimates of 
effects or regression coefficients are 
desired for all the tests. Otherwise 
leave blank 

This column controls what output is to 
be printed. Leaving it blank prints 
the bare essentials of the test. 
Punching a 1 (one) or 2 (two) prints 
additional output. See Table I 

This rolumn also controls what output is 
to be printed. See Table I 

This column controls regression in analysis 
of^ovariance and must be used for all 
covariate adjustments of data. Punching 
a 1 (one) produces the classical analysis 
of covariance adjustment. Punching a 2 
(two) produces a reduction of the hypothesis 
sum of squares by the error regression 
weights. Failure to enter a code results 
in the classical adjustments. 
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3. Contrast Card Sets 

The description belcv shows that the required information may demand 
more than one card, hence the term "card set" rather than "card." 

One set is required for the continuous variables and one set for each 
factor of an ANOVA design. If the data comprise a single sample as often 
happens in factor analysis or correlation analysis, it is still necessary 
to include a contrast card set for a "factor of a design"; that is, treat 
the datum as if it were a one-way design with a single level. For this it 
may convenient to have a 1 at some position in each data record; or 
any constant digit can be used and recoded, even a blank. The order of 
the card sets must be the same as the order in which the data factor codes 
appear on the variable format statement. 

It is not necessary that all "factors" of a design appear on the 
significance test can . This makes it possible to use a "factor" index 
code for file editing or in later reanalyses of the data. Use of a factor 
code for editing should be limited since the culled cases are rejected 
and listed on the output. 

Cols Name < Explanation 



NTABLE 



2-3 



ICONT 



For a factor of an ANOVA design: Any 
letter except V or W, but no two 
factors may use the same letter 

For the continuous variables: 
The letter V only 

Contrast codes for design variables 
= b Regular contrasts: Also called 
nominal and deviation contrasts 
Example: for three levels of a factor 
mean 1/3 1/3 1/3 
df, 2/3 -1/3 -1/3 

-1/3 2/3 -1/3 



df 2 



= 1 Special contrasts. Any set of contrasts 
the user wants. The matrix must be fed in 
as described below and must follow this card 
set immediately 



-23- 



iA 


1A 


lA 


iA 


-l 


1 


0 


0 


-1/2 


-1/2 


1 


0 


-1/3 


-1/3 


-1/3 


1 



Explanation 

= 2 Orthogonal polynomials (integral values 
only). This option generates orthogonal 
polynomial contrasts and design parameters as 
integral values. The number of levels is 
limited to 19 for numerical reasons. The 
metric of the polynomial (i.e., the linear 
coefficients) must be supplied and must follow 
this card set immediately. 

= 3 Reverse Helmert contrasts: also 
'called difference contrasts. These 
contrasts are as follows: 

mean 

df i 

dfi 

df 3 

It should be noted that the fractions 
involved in these contrasts are not 
exactly representable on computers. 
This may cause serious numerical problems 
due to rounding errors when the number of 
data cases is large, say a couple oi 
hundred . 

0 

= h Special design parameters. Any set of 
design parameters the user wants. The 
matrix must be fed in as described below 
and must follow this card set immediately. 

= 5 Orthogonal polynomials (decimal values 
only). This option generates orthogonal- 
polynomials with normalized coefficients. 
Coefficients can be generated up to order 39, 
and the polynomials of order 39 are accurate 
at least up to degree 5. The accuracy of the 
higher order polynomials is not guaranteed. 
The metric (i.e., the linear coefficients) 
must be supplied and must follow this card 
set immediately, 

For power functions of continuous variables 
= 0 no powers generated 

= N (after V). When the contrast card 
set describes the continuous variables, this 
number describes the number of powers of 
each variable that will be generated to 
^orm a new variable; that is, if n = 3, 
the variables V, V 2 and V 3 will be available 
for analysis. It will also be assumed that 
there are N times the number of variables 
(i.e., 3 x V) in the new analysis and that 
they are in the order V, V 2 , V 3 . 
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Cols 



Name 



Explanation 



U-6 



NLEV 



Number of levels for the factor or, after V, 
the number of variables (not including 
powers ) 



KRDNA 



= b if the dummy or, after V, continuous 
variables are to be labeled by number 



= 1 if the dummy or, after V, continuous 
variables are to be labeled by names 
which will be supplied immediately after 
this card set 

10 KLINT When the variables are to be subjected to 

linear transformations before analyses, 
punch a 1 in column 10 and follow the 
contrast card set with the regression 
coefficients. Coefficients are expected 
in the same order as the variables are 
entered 



11,12 

and 2 

column 

fields 



LEVSUB 



the next 
2 column 
field 



in 2 

column 

fields 



RECODE 



When the dummy or the continuous variables 
are to be subdivided .into partitions, 
this is number of variables in each 
partition. The number of degrees of 
freedom or variables are punched in 2 
column fields and must account for 
all degrees of freedom or the total 
number of variables. If no partitioning 
is done, ignore 

(two commas). This is necessary whether 
or not the variables are partitioned. If 
the variables are not partitioned, the 
commas go into columns 11 and 12 

Recoding of factor level identification or 
reordering of the variables 

For recoding the level identification, 
enter the code as it appears in the data 
file in the order in which the codes are 
to be renumbered. For example, if the 
codes are 15, 5 and 31 and these are 
to be recoded as 3, 1 and 2 make the 
entries in columns 11 through l6 as b 53115. 
Use two column fields throughout and account 
for all levels. Zero is an admissible level 
code. 
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Cols 



Name 



Explanation 

For reordering continuous variables, 
enter the serial numbers of the variables, 
as they appear on the data file, in the 
new order. Use two column fields through- 
out and account for the total number of 
variables . When several operations are 
done to the variables "*n sequence, the 
sequence of operations is 
1 # raising to powers 

2. making linear transformations 

3. reordering 
k. partitioning 
5. naming 



In employing these features, the user must keep 
in mind such problems as (l) linear trans- 
formations must include all powers of 
the variables, (2) partitioning takes 
place on transformed and reordered variables 
etc . 



in the 
next 2 
column 
field 



When variables are raised to powers, columns 
k to 6 must contain the number of original 
variables, but all subsequent operations 
must take account of the original variables 
and their powers. That is, if there are 5 
original variables and these are raised to 
the third power, the program will expect 
15 linear transformations, 15 variables to 
reorder, 15 variables to partition and 
15 names 

(two commas). This is necessary whether or 
not the variables are partitioned and 
whether or not reordering or recoding has 
beer. used. If neither feature is used, 
the commas go into columns 13 and lh 



the next 
20 or 
less 
columns 



ANAME 



Any alphameric description of the factor or 
variable set* This may be as long as 20 
characters 
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Cols Name Explanation 

Last . (a period) 

Column 



Contrast Card Set Options 

These descriptions govern use of options found on the contrast cards. 
Of the k options, use of one precludes use of the others except for the 
variable names option. When variable names are supplied, the names cards 
must precede cards used for other options. All options requiring cards 
must follow the contrast card set to which they apply and precede any 
successive contrast card set. 

a. Variable names cards 

The use of this option enables printing of names on the output. 
Both continuous variables and design parameters (or contrasts) 
may be named, although dummy parameters for ANOVA interactions 
will always be numbered. 

Names will have 8 characters (including blanks) and 10 names 
may be put on a card. When dummy parameters are named there 
must be as many names as there are degrees of freedom. When 
continuous variables are named there must be as many names 
as there are variables. When the continuous variables are 
reordered^ the names must be supplied in the new order as the 
names are not reordered. When linear transformations are made, 
the names will be affixed to the transformed variables 
(including analysis of covariance). When powers of the variables 
are generated, ti:e program expects as many names as there are 
variables times povers. 

b. Special contrast or special design cards 

This option enables use of special one-way contrasts and design 
parameters. They are entered as square matrices with as many 
rows and columns as there are levels of the factor. The first 
contrast or design parameter entered is always that for the 
constant term or grand mean and is the first row of the matrix. 
The weights for obtaining the constant term or grand mean need 
not be all equal nor do the contrast coefficients or design 
parameters need to sum to zero, although this is usually 
desirable. 
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The elements of the matrix are punched in 10 column fields, 
8 elements per card. Each row must begin on a new card and 
may continue on as many cards as necessary. 

c. Orthogonal polynomial metric 

The coefficients of the linear polynomial are called the metric 
and are usually (depending on the experiment) the integers 
from 1 to N indicating equal spacing of the levels of treatment. 
Equal spacing is not necessary, however, and the metric need 
not be successive integers. 

The coefficients of the linear polynomial are entered in 10 
column fields, 8 per card and continued on as many cards as 
necessary. 

d. Linear transformation of the continuous variables 

This option allows for testing linear combinations of the 
continuous variables. To use the option it is necessary to use 
all the variables in the regression equations and to have as many 
regression equations as there are variables even though some of 
the coefficients might be zero or one. The regression equations 
are entered as a transformation matrix, with the equations as 
rows. Each row begins on a new card, the coefficients are punched 
in 10 column fields, 8 per card, and on as many cards as necessary. 

When reordering is used in conjunction with linear transformations, 
the reordering takes place on the transformed scores. 

5. Significance Test Card 

The significance test card describes the model under which the data 
are to be analyzed and tested. Linear models in three styles of calculation 
are available: analysis of variance, correlation, and factor analysis. 
The calculations to be made are indicated by the acronyms AHOVA, CORBEL, 
or FACTOR in the first few columns of the significance test card. 
Calculating interaction tables in a factorial sampling design can be 
indicated by the acronym IHTABL. Variance reduction studies can be 
calculated by using "the acronym VAR^D. 

Use of the acronym AHOVA invokes solution of the linear model as an 



analysis of variance problem. Either one variable or many may be analyzed 
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in the model. The null hypothesis test utilizes Wilks 1 lambda 
criterion as approximated by the F distribution. Although the lambda 
distribution is ostensibly multivariate, its degenerate cases, uni- 
variate ANOVA, Hotelling's T 2 , Mahalanobis 1 distance D, discriminant 
analysis and "Student's" t are all handled automatically. 

Use of the acronym CORREL invokes solution of the model as a corre- 
lation problem. The null hypothesis test is Wilks 1 lambda criterion as 
is the case for AHOVA problems. Again, the degenerate cases of canonical 
correlation: multiple correlation, product moment correlation, biserial 
correlation and point biserial correlation are all handled automatically. 

Use of the acronym FACTOR invokes a factor-analytic decomposition of 
the data. Several types of solutions are available as displayed in 
Table II. The acronym FACTOR must be immediately followed by a digit 
from Table II to denote which factor decomposition is to be used. It 
is possible to denote a rotation procedure by following this digit with 
a number sign (#) and another digit to denote type of rotation. When 
comniunality procedures other than squared multiple correlations are used, 
the individual significance test card also must be used. When less 
than all the factors are to be extracted, the individual significance 
test card must be used. 

Use of the acronym INTABL generates interaction tables for ANOVA 
problems. Interaction tables are generated as a nesting procedure with- 
out including the grand mean in the model. Thus the statement ,WAB=V1, 
generates all the means of the variables in V for the AB interaction 
of an AHOVA model. It is possible to generate many interaction tables 
in the same model statement. The estimates used for constructing 
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interaction tables are the raw estimates and are not subject to analysis 
of covariance adjustments. Interaction tables adjusted for covariates 
may be obtained by using regression transformations on the original 
variables. 

Use of the acronym VARED generates a study of the sum of squares 
for hypothesis before and after reduction of the model to orthogonality. 
Model statements follow the rules for ANOVA and CORREL models. VARED 
studies are not subject to analysis of covariance adjustments except 
by using regression transformations on the original variables. 

On the remainder of the card, special symbols, letters and numbers 
are used to designate the model to be analyzed. 

Letters . The letters used are those from the contrast card set. 
That is, use of the letter A assumes that there is an effect and a factor 
of the design to be called A and that there is a contrast card set that 
has ft A tf punched in column 1. 

There is also a contrast card set for the continuous variables which 
has a "V" in column 1. When the phrase ,A=V, is punched into the card, 
a sum of squares for the A effect will be generated and tested for its 
regression on the variables V. 

If the design is a factorial, there may also be a B effect to test. 
For this we punch the phrase ,B=V, in the significance test card. To 
test the interaction effect v/e punch the phrase ,A3=V, in the card. 
The use of two or more letters adjacent generates the Kronecker product 
of the pain effect design parameters necessary fci testing interactions. 

Order of Kronecker product . This program generates Kronecker product 
in the order determined by the order of the contrast card sets and 
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ultimately by the input data record. Consider the term AB. If the 
contrast cards set for factor A precedes that for factor B (which is to 
say the identification code for factor A is to the left of the identi- 
fication code for factor B on the data file) the AB dummy parameters are 
generated and the estimates printed in the order & 2 b l 9 ^-^V 9 ' 

a n^l 9 a l b 2' a 2 b 2* ' ' ' ' a 'n ) 2' 3 'l} ) y ' ' ' ,a n^rn * ^ e same or< * er is generated 
whether the interaction term is written BA or AB. 

Equal sign . An equal sign ( = ) is used to separate the hypothesis 
variables from the error variables. The hypothesis variables are 
designated by letters and numbers to the left of the equal sign; the 
error variables are designated by letters and numbers to the right of 
the equal sign. 

Commas, colons and periods . Commas ( , ) are used to separate the 
tests from each other. A colon ( : ) is used to separate the model 
acronym from the tests; and a period (.) is always the last character 
in a model statement. 

Grand mean and zero . The constant term or grand mean may be included 
in the model by using the phrase ,W=0, where W indicates the constant 
term as a hypothesis and 0 (a zero) indicates that there are no error 
variables for the hypothesis and no test to be made. The use of zero 
as an error term is the way in which a set of variables may be included 
in the model as if they were hypothesis variables but not tested. 
When 0 is used as a hypothesis, it indicates that the error term is 
to be subjected to factor analysis. 

With this information it is possible to write a significance 
test card for a simple factorial analysis of variance: 
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( a ) ANOVA : W=0 , A= V , B= V , AB=V . 

or a significance test card for a factor analysis: 

(b) FACTORl:Ws0,0=v. 

Numbered partitions . The contrast card option for partitions makes 
it possible to subdivide the sets of design parameters and continuous 
variables into several subsets. The partitions are written on the 
significance test card as Al, A2, or VI, V3, BU, etc,, the number 
referring naturally to the ordinal position of the partition (this use 
of numbers can be easily confused with the use of numbers in nested 
models ) . 

With this information it is possible to write the following models 
and many others. 

(c) Partitioned analysis of variance (such as orthogonal 
polynomial tests) 

MOVA : W=0 ,A1=V , A2= V , B=V , A1B=V , A2B=V . 

(d) Correlation models 
CORREL:W=0,V1=V2. 

(e) Homogeneity of regression in a one-way design 
MOVA :W=0 , A=0 , V1=0 , AV1=V2 . 

(f) A principal factor analysis on several samples 
FACTOR 'j , W=0 , A=0 , 0=V . 

(g) Correlation analysis from several samples 
CORREL: W=0 ,A=0 ,V1=V? . 

Plus sign . The plus sign is used to pool several effects to produce 
a sum of squares. In an analysis of variance with several factors, one 
may pool high order interactions and test them simultaneously for the 
pooled variables V1+V3. 
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For instance, 

( h ) , AB+AOBC+ABC=Vl+V3 . 

When orthogonal polynomials are partitioned, this also allows re- 
pooling of high order polynomials for simultaneous teSving, as follows, 

(i) ,Pl=V,P2=V,P3+P*i+P5=V. 

When + is used between two sets of variables, the variables are 
pooled before any other operation takes place: For example V1/V2+V3 
indicates that both V2 and V3 are jointly covariates for VI. 

When the parameters for a given effect are used both singly 
in some tests and pooled in other tests, the order in which the 
parameters are presented on the significance test card must always 
be the same for every test in which the parameters are used. 

V/ and numbers for nested analyses . The letter W is commonly used 
in statistical literature to indicate nesting; BWA indicates that 
several samples, B, are nested in each of the levels or samples of A. 

4 

This notation is expanded slightly here as follows. 

(j) ,BWA1=V, indicates a test of B within the first 
level of A. 

(k) ,BWA1+BWA2=V, indicates a pooled test of B within 
only the first two levels of A. 

(l) ,BWA=V, indicates a test of B pooled for all levels 
of A. 

(m) ,BWAC, indicates a test of B pooled over all cells 
of a two-factor design, AC. 

(n) ,V1WA1=V2, indicates a test of the correlation between 
VI and V2 within the first level of factor A. 

Here, a number, used after a letter which follows a V/, indicates 

the level, not a partition. A number useji after a letter but before 

a W (or in the absence of e W) indicates a partition. 
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Slash (/) for analysis of covariance . The use of a slash indicates 

that all variates after the slash and before the next comma (or colon 

or period or minus) are to be covariates for the test ii.I-cated. A 

slash and variables immediately following the acronym indicates that 

all the tests in the model have those variables as covariates. This 

makes possible the following types of models. 

(o) Analysis of covariance in a factorial analysis of 
variance 

MOVA:W=0,A=V1/V2,B=V1/V2,AB-V1/V2. or 
MOVA /V2 : W=$f ,A=V1 ,B=V1 , AB=V1 . 
(p) Partial correlation 
CORREL:W=tf,Vl=V2/V3. 
(a) Principal components analysis with covariates 

FACTORl/V2:W=0,0=Vl. 
(r) A1. T OVA:W=0,A=V1,B=V1/V2,AB=V1/V2+V3. 
Ampersand (&) for extension . The use of an ampersand indicates 
that all the variables after the ampersand and before the next comma 
for colon or period or minus) are to be used as extension variables 
to the canonical variates of the test indicated. An ampersand immediately 
following the acronym indicates that extension is to be done to all 
the canonical variates in all the tests in the model statement. This 
makes it possible to write the following models. 

(s) Discriminant analysis of VI and A extended to V2 
AIIOV A : W={f > A =V1&V2 . o r 
ANOVA&V2:W=0,A=V1. 
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(t) Canonical correlation between B and VI extended to 
V2 and V3. 

CORREL:W=0,B=V1&V2+V3. 
(u) Factor analysis of VI extended to V2. 
FACTORS : W=0,0=V1&V2. 
Number sign (£) for rotation . The use of # after the acronym 
FACTOR and its digit indicates that factors are to be rotated. A 
digit must follow to denote which rotation scheme is to be used. 
Table III lists the types of rotations available. This makes it possible • 
to write the model. 

(v) Alpha factor analysis with equamax rotation , 
FACTOR7#3:W=0,0=V. 

(Rotation of canonical variates in AIJOVA or COFREL 
can be done only by using the individual significance 
test card. ) 

Asterisk (*) for components of variance . In many components of 
variance models, the error term for a test can be generated as a 
regression sum of squares. More simply, the sum of squares for hypothesis 
of one test may be the sum of squares for errors of another test. 
Therefore, to make it easy to calculate the sum of squares for error 
we use the notation A*V to denote the regression of continuous variables 

V on dummy parameters A to form an error term. This notation is allowed 
only on the right of an equal sign where errors are designated. 

It should be noted that this program will not solve multivariate 
components of variance models where the number of continuous variables 

V is more than the number of dummy parameters A or degrees of freedom 
for error. 
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Special notes , 

(a) When effects are partitioned and repooled (for example V1+V3) 
the pooling must always be presented on the significance test card 

in the same order. That is, you cannot state A=V1+V3 and B=V3+V1 
in the same model. You must state A=V1+V3 and B=V1+V3. Also, when 
effects are partitioned, they must stay partitioned: i.e., A=V is not 
an alternative for A1+A2=V when A has been partitioned. 

(b) In one model statement, a set of parameters may be used onl v v 
for a hypothesis, an error, an extension or a covariate but not for 
two of these. 

6. Variable Fbrmat Cards 

These cards describe the way in which the data on the observations 
appear on the data file. There may be as many as 10 -cards to describe 
the format. The use of the variable format follows the customary 
FORTRAN restrictions. 

Suppose the data appeared as follows: 



Cols. 1-5 information to be ignored 

6 level number of the first factor 

7 information to be ignored 

8-9 level identification of the second factor 

10-13 a r * urn on the first continuous variable 

lU_l8 a datum on the second continuous variable 

19-21 information to be ignored 

22-25 a datum on the third continuous variable 

26-to end information to be ignored 
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The information on these records can be read with the following 
variable format statement 

(5X,Il,lX,l2,Ft.O,F5.0,3X,Ft.O) 
It will be noticed that all identification numbers are expected in 
fixed (I, integer) format while e" continuous variable scores are 
expected in floating (F, decimal) format. 

About factor identification . The reading of factor level identification 
must precede the reading of variable scores. In the above example the 
data record is arranged so that this occurs naturally. It is possible 
that the factor level identification is interspersed among the variable 
scores; in this case the "tab" feature of format statements may be used to 
read the factor level identification before the variable scores. If this 
cannot be done the data will need to be rearranged to put level codes first 
in the records 

All records must contain factor level information. When single 
samples are analyzed, it will be necessary either to include a constant 
on the file or to fake a factor with one level. This may be done by 
reading a number off the record and recoding it to 1 or by reading a 
blank on the record and recoding it to 1. 

On IBM machines it will also be necessary to note the following 
comment about reading integer variable scores in F format: for example 
reading the score 12 as F2.0. 

The score 12 read as F2.0 occupies 2 characters. When the program 
copies this score to save it for reanalysis, it copies 12.0 in two 
characters 2.0 which does not include the 1: The copy is overflowed. 
When the copy file is read for reanalysis the program will register an 



-37- 

overflow in subroutine DATVEC. To prevent this disability either 
manufacture the original file as 012 and read it as F3.0 or manufacture 
the original file as 12. and read it as F3.0. 
7. Data 

The data may be on any file as long as the file is designated on 
the' problem card by a two-digit number in columns 17 and 18. Blanks 
in these columns indicate the data are on cards and follow the variable 
format statement. 

If the data are on cards, a blank data record must follow the data: 
there must be as many blank cards as there are cards in the data record 
of one observation. If the data are on a tape or disk file an ordinary 
end-of-file mark will do. 

As a rule it is simpler to arrange factor level identification codes 
first on the data record before the continuous variable scores because 
these codes must be read in first in the record. This expedient is 
not a necessity on many machines because of the "tab " feature of 
FORTRAN IV compilers. 

8. Individual Significance Test Control Card 

The individual significance test cards are used to control procedures 
which can be applied to particular significance tests and not to others. 
These procedures are expository in nature and not usually subject to 
statistical testing. Arbitrarily, the number of statistical tests 
in the model to which these procedures can be applied is limited to six. 

Each individual significance test control card is presented with 
the options which apply to it. The control cards are presented in the 
reverse order from the order of the tests on the significance test 
card (i.e., last to first). 
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Column Name Information 

1-2 NOW The number of the significance test to 

which this information applies. To 
obtain this number from the significance 
test card start at the period and, pro- 
ceeding backwards toward the acronym, count 
the number of equal signs until the test 
which applies is reached, use that number 
in columns 1 and 2. 

3-H CRIT The probability, multiplied by 100, used to 

control (l) additions and deletions of 
variables in battery reduction, (2) 
dimension reduction (use of minus signs) 
in ANOVA and CORREL and* (3) the number of 
canonical variates to rotate in ANOVA 
or CORREL. 

6 — Type of battery reduction procedure for error 

variables 

=1 Efroymson's stepwise addition/elimination 
procedure 

=2 Wherry-Doolittle stepwise addition 
procedure 

=3 Step-up variable deletion procedure 

7-8 MAXFAC Number of factors to be utilized in factor 

analysis tests, rotations and extensions. 
This does not apply to the two principal 
components solutions as they always obtain 
as many components as there are variables. 
It does not; apply to principal factor 
solutions as it always obtains one less factor 
than there are variables. All other procedures 
require an entry. 

10 — Statistical test procedure for factor analysis 

=0 none 

=1 Rao's test, for canonical factor analysis 
only 

=2 Rippe ! s test 

12 — Type of rotation technique for factor analysis 

or multivariate ANOVA or CORREL. (Use of 
this is not necessary if the # (number sign) 
is used after the acronym FACTOR. ) See Table 
III for the usable codes. 



Column Name 



Information 



1 1 | For direct or indirect rotations 

=1 Direct rotation of canonical variates 

or factor analysis rotation 
=2 Indirect rotation of canonical variates 

■jLg For taxonomy of rotation 

=1 for taxonomy of variables 

=2 for taxonomy of groups (in MMOVA only) 

3j_l8 Criterion for uniqueness covergence in canonical 

factor analysis. This criterion is entered as 
an exponent (multiplied by 10) of 0.1 so as 
to give a wide range of values to convergence. 
The entry may be any two-digit number (see 
Table IV ). For example, the entry 31 ^.i 
produces a convergence criterion of 0.1 = 
.00079^329. 

20 „ Communalities supplied for factor analysis 

=1 if squared communalities are supplied 
for factor analysis. Only three procedures 
use this option: principal factor analysis, 
canonical factor analysis and image analysis. 
If communalities are supplied, they must 
immediately follow this card. 

=0 if squared communalities are to be the 
squared multiple correlations between the 
variable and all other variables. IT IS NOT 
NECESSARY to use an individual significance 
test control card to have squared multiple 
correlations used as communalities. All 
common variance factor analysis procedures 
have an automatic default to squared 
multiple correlation. 

22 „ =1 if weights are supplied for multiple groups 

orthogonal centroids or orthogonal bounds 
rotations. If weights are supplied, they 
must immediately follow this card or the 
supplied squared communalities. 

Individual Significance Test Control Card Options 
a. Supplied communalities 

Communalities (squared) must be entered 8 per card, in 10 

column fields, and in the order in which the variables are to be 

analyzed. 
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b. Weights for rotations 

The weights on each variable which determine a transformation 
vector are entered 8 to a card on as many cards as necessary. Each 
weignt is entered in a 10 column field. In each transformation 
vector there must be as many weights as there are variables and 
each transformation vector must begin on a new card. 

The number of transformation vectors to be entered is determined 
by the problem. For ANOVA and CORREL problems, there must be as 
many vectors as there are canonical variates; i.e., the minimum 
of the number of variables (degrees of freedom) for the hypothesis 
or the number of error variables. For factor analysis the number of 
sets of weights must be either the number of variables or the number 
of factors specified in columns 7 and 8 of the individual significance 
test control card. 

10. TITLE Cards (optional) 

TITLE cards may be used to identify reanalyses. 

11. Reanalysis Card (optional) 

This card allows, in the same run, reanalysis of the data. Two major 
changes are allowed in reanalysis: (a) change of any or all contrast card 
sets and (b) change of the model statement. There are some features of the 
first analysis which cannot be changed on reanalysis: (a) the number of 
contrast card sets used cannot be increased (although some factors of the 
design may be ignored in the model statement); (b) the number of levels 
or variables may not be increased (although some levels or variables 
may be ignored on the model statement); and (c) the format cf the data 
(the variable format card) may not be changed. 
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Information 



ANLY, these letters exactly 

If a nev significance test card is used to 
alter the model analyzed, punch a 1 (one) 
in column 6. (Each change of model must 
be accompanied by a reanalysis card) 

The number of changed contrast card sets 
which are to be used in reanalysis. This 
feature can be used to alter several features 
of the model. 

(a) The contrast used for a given 
"factor" of the design may be changed. 
The letter designation of a "factor" 
may not be changed. 

(b) The way in which the dumntf parameters 
and continuous variables are parti- 
tioned into subsets may be changed. 

(c) The names of variables may be changed. 

(d) The transformations of variables may 
be included or changed. 

To change these features simply follow 
the original instructions above. 

The following features are the same as described for the PROB card 

and appear in the same card columns. 



10 The number of individual significance test 

control cards for this reanalysis 

12 Print cell means and variances (l means yes) 

lh Print reduced model matrix (l means yes) 

20 Print only estimates or raw regression 

coefficients (l means yes) 

22 Controls printed output. See Table I 

2U Controls printed output. See Table T 

26 "Vpe of covariance adjustment (0 or 1 does 

the classical adjustment, 2 uses the error 
regression weights for reduction) 



Cols 
1-1* 
6 
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The options of columns 10 to 26 must be reinstated for each reanalysis 
because doing a reanalysis erases the controls of the previous analysis. 

12. Several Problems 

Many sets of data may be analyzed in one run. The cards for each set 
may follow each other. 

13. FINISH Card 

A card with FINISH in columns 1-6 may follow the last problem. 



Programming Notes 
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ERIC 
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Programming Notes 

The VARAN program is written in FORTRAN IV for an IBM 36O-65 computer. 
Insofar as possible it is written to be compatible with any computer which 
uses FORTRAN-like compilers. Such features as may need changing are 
accessible without much labor. The following list shows some of these 
features . 

1. All multiple precision statements are titled "DOUBLE PRECISION." 

2. All input-output units are designated by integer constants which 
can be changed in the main program. 

3. All dimensioning constants for changing the size of arrays are 
centralized in the main program and four executive subroutines. 
This allows for easy changing of size parameters and adaption of 
the program to special problems. The arrays and their dimensioning 
constants are listed in Tables VI through VIII. The program is 
distributed in three sizes, Tiny, Standard, and Large. The FORTRAN 
decks are the standard size while instructions are on the tape 

for implementing the tiny and large sizes. 
The program is put together in two basic sections, one for the clerical 
part of setting up the model, and another for doing the mathematics of the 
solution. The clerical section is controlled by the executive subroutines 
SUBEX1 , SUBEX3 and SUBEX**. The mathematical section is controlled by the 
executive subroutine SUBEX2 with the factor analysis subsection controlled 
by SUBEX5. Table V lists each of the subroutines with a brief description 
of its purpose. This table lists each major executive subroutine and after 
it the miscellaneous subroutines used by it as well as the major subroutines 
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yhich are called by the executive subroutine. The order of the listing 
is roughly dictated by the following considerations: 

1. the order in which the program executes 

2. the order in which the overlay structure is put together, and 

3. the order in which the FORTRAN decks are listed on the mailer 
tape (the exception here is that the executive subroutines are 
at the head of the mailer tape) 

Comment cards are used quite liberally to explain the functions of 
subroutines. At the head of each subroutine there are several comment cards 
which describe the function of the subroutine and how it operates. 

Accuracy of calculations . The programming sequence was arranged in such 
a way as to be convenient and accurate for multivariate problems. Therefore 
some of the univariate side statistics have been manipulated excessively 
outside the main flow of calculations and do not serve as checks for 
numerical accuracy. Accuracy checks should be made on the multivariate 
calculations . 



Subroutine Name 
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Table V 



Function 



MAIN 



Main program 



Called frcm 



MATA 



Add matrix A to matrix B and put in C 



MANY 



MATX 

Multiply matrix A times matrix B and put in C 
ERROR 

Writes out error messages 
IMPINV 

Subroutine to obtain an improved inverse 



MANY 



MAIIY 



MAilY 



IMVT 



Inversion of asymmetric matrix-Gauss pivotal 



IMPIKV 



ERIC 



IFRMA 

Function to convert alphabetic characters to integers 

MOVCHR 

Moves a character 

LOOKUP 

Search table for value equal to X 
BINBCD 

Converts binary to EBCDIC (IBM 360) 
SUBEX1 

Executive subroutine to set up data vectors 

*»• 

RDCONT 

Read contrast cards and contrast card options and set up one way 
design parameter matrices 

DEVDES 

Obtains deviation contrast design parameters 
SPCON 

Reads in special contrasts (transposed) and computes design 
variables for special contrasts 

ORTPOL 

Computes orthogonal polynomial design variables 
HELDES 

Obtains reverse Helmert design parameters 



MANY 



MANY 



MAHY 



RDSIGT 



MAIN 



SU3EX1 



RDOOST 



RDCONT 



SDCONT 
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Sub routine Name 



Table V (Cont'd) 
Function 



SPDES 

Reads in special one-way design matrices 
RDSIGT 

Read the significance test card for the first time and set up the 
parameter codes for the terms listed. The scanning is done 
from left to right stopping at the period 

DATVEC 

Set up the data vector for each data case 

1*EST 

Generates an identity matrix for nested effects 
REMOD 

Print reduced model matrix • Generate data vectors for cells which 
are present 



Called from 



RDCONT 



SUBEX1 



WSTA1 



SCP 



Finds the cell means and STD DEVS 

X3 

Executive subroutine to obtain cross products matrix and sweep it 
Accumulate and store cross-product matrices of the data vectors 



SUBEX1 
DATVEC 
SUBEX1 

SUBEX1 
MAIN 
SUBEX3 
SUBEX3 
MAIN 



SWEEP 

Perform Gaussian sweep by matrix blocks 

SIBEX14 

Scan the significance test card and set up for each individual test. 
First scan up to the colon to get permanent covariates , extensions 
and rotation types which hold for all the tests. Then skip to the 
period and scan backwards, one test at a time, between commas and 
minus signs . 

HYERCO SUBEXU 
Get up matrices for a hypothesis-error only, hypothesis-cov, hypothesis- 
extension, or hypothesis-vector product analysis 

HOHYCO SUBEXU 
Set up matrices for no hypothesis-error, no hypothesis-covariate, no 
hypothesis-extension, or no hypothesis vector product analysis 



1IITER 

Set up matrices for interaction tables 



SUBEXU 



Subroutine Name 



APLB 



Table V (Cont'd) 
Function 



Form matrices for the analysis subroutines 
RECON 

Reconstructirn of matrices to conform to each other in terms of 
what has been swept 

GETM 

Get the I, J matrix from the SWP tape 
SUBEX2 

Subroutine for controlling analytic sequence 



ATOBT 



Subroutine to move matrix A to B and transpose it 



MAB 



Multiply matrix A times matrix B and put in C 



MABT 



Multiply matrix A times matrix B-transposed and put in C 



Called from 



MANY 



MATB 

Multiply matrix A-transpose times matrix B and put in C 
OUTAB 

Subroutine to print out tables 

PROB 

Subroutine to get probabilities in several variance distributions 
including F, t, normal deviates and chi-square 

UNDFLW 

Subroutine to centralize IBM ERRSET controls 
WILKS 

Calculate the overall F-test for Wilks lambda criterion, for 

multiple roots calculate chi-square for Bartlett's dimensionality 
reduction 

HOW 

Subroutine to obtain eigenvectors and values of a matrix 
TRIDI 

Tri-diagonalization subroutine DWM 1517-UB 



APLB 



APLB 



MAIN 



MANY 



MANY 



many 



MANY 



MANY 



MANY 



MANY 



MANY 



MAN* 



HOW 



ERLC 



1 
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Subroutine Name 



Table V (Cont'd) 
Function 



EIGVEC 

Set up simultaneous equations for eigenvector wich eigenvalue E 
EIGVAL 

Eigenv^ue subroutine for tri-diagonal matrices DWM 1517-UB 

MEANS 

Subroutine to provide estimates only or interaction tables 

DIIIRED 

Subroutine to perform dimension reduction between separate hypotheses 
of ANOVA 

ADJHYP 

Subroutine to adjust hypothesis variables sum of squares and 
hypothesis-errors cross products 

ADJERR 

Subroutine to adjust error sum of squares and raw estimates for 
covariates 

HEDHYP 

Subroutine to reduce hypothesitf-errors cross products by error 
regression weights 

ESTADJ 

Subroutine to obtain raw estimates adjusted for covariates from 
raw cross products 

ANAL 

Performs calculations for AHOVA and CORREL 

DECOMP fi 

Subroutine to decompose a symmetric matrix A into TT ! where T is 
triangular 

PRANOV 

Subroutine to print AHOVA calculations from ANAL 
PRCOR 

Subroutine to print correlation calculations from ANAL 
SUBEX5 

Subroutine to sort out factor analysis procedures 
TRAHV 

Subroutine to follow immediately after DETER 



Called from 



HOW 



HOW 



SUBEX2 



SUBEX2 



SUBEX2 



SUBEX2 



SUBEX2 



SUBEX2 



SUBEX2 



ANAL 



SUBEX2 



SUBEX2 



SUBEX2 



MANY 



9 

ERLC 



1 
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Subroutine Name 
DETER 

Obtain determinant of A 



Table V (Cont'd) 
Function 



SIMEQ 



Solve (A + PHI*I)X = B • A is symmetric . 
RESOL 

Solve. (A + PHI*I)V = X subject to V(TR)*V = CON (usually = l.J 
Restricted least squares. 

FIRFAC 

Subroutine to generate first five solutions of factor analysis 
ALPHA 

Subroutine for alpha factor analysis: Michael Browne, circa 1968 
MALFA1 

Maximum likelihood factor analysis. Version 2. Variables giving 
Heywood cases are partialled out. 

MINRES 

Least squares factor analysis. Fixed unit weights. 
PRIFAC 

Subroutine to print factor analysis results 



RAO 



Subroutine to compute Rao's test of significance for canonical 
factor analysis 

RIPPE 

Subroutine to compute Rippe's test lor completeness of factorization 
RuTCON 

Subroutine for controlling rotations 
VARMAX 

Subroutine to perform quartimax, /arimax, and equamax rotations 
PROMAX 

Performs promax rotations 
WATCEN 

Subroutine for doing three rotations, multiple groups, orthogonal 
centroids and orthogonal bounds 



Called from 



MANY 



MANY 
MANY 

SUBEX5 
SUBEX5 
SUBEX5 

SUBEX5 
SUBEX5 
PRIFAC 

PRIFAC 
SUBEX2 
SUBEX2 
SUBEX2 
SUBEX2 



ERLC 
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Subroutine Name 



GRAM 



Table V (Cont'd) 
Function 



Factor to lower triangle and orthogonal matrix 
PRIROT 

Subroutine for printing results of rotations 
EXTEND 

Subroutine to perform and test extensions 
BATRED 

Subroutine to do battery reduction techniques on error variables 

ELIM 

Subroutine to partial out variables from a canonical correlation 
problem ' 

VARED 

Subroutine to calculate loss of variance due to orthogonalization 
of design or of covariance analysis 

INDSIG 

Subroutine to read individual significance test cards and options 



Called from 



WATCEN 



SUBEX2 



SUBEX2 



SUBEX2 



BATRED 



SUBEX2 



SUBEXii 
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Table VI 



Arrays with Variable Subscripts 



Array 



varnam(mnl,mf) 
kstart(mnt) 
tname(mnc,mnt) 
qname(mncq) 
x(mnsigc*8o) 
hypnam(mnl) 
errnam(mnl) 
othnam(mnl) 

levsum( mhl ,mnf ) =levsub 
ilhe(mmhe) 



(double precision) 



(double precision) 
(double precision) 
(double precision) 



dum(mnl,mnl) 
cont(mnl,mnl,mnf) 

iWABLE(MNF) 

ICONT(MHF) 

NLEV(MNF) 

CMPVAR(MNF) 

RDNAM(MNF) 

TRAN(MNL,MNL) 

NPART(MNF) 

LEVCUM(MNL,MNF) 

RECODE(MNL,MNF) 

mame(mna,mnf) 

is(mhf) 

ie(mbf) 

IC(MNF) 
IN(MNF) 

NAFCD(MNF,MNT) 

ICELL(MNF) 

VAR(MNL) 

DVEC(MNCD) 

IFL(MNF) 

KSTART(MNT) 

FMT(NFMT) 

VECT(MNL) 

SUM(3,MNSV,MNCELL) 
IDENT( MNCELL , MNF-2 ) 
IRM(MNCEJ.L) 

A(NORD,NORD) 
B(NORD,NORD) 
C(NORD,NORD) 
D(NORD,NORD) 
E(NORD,NORD) 



Subroutines with Fixed Dimensions 

MAIN,SUBEX1,SUBEX*4 

MAIN , SUBEX1 , SUBEX3 , SUBEXl* 

MAIN,SUBEXl* 

MAIN,SUBEXi* 

MAIN,SUBEXU 

MAIN,SUBEX2,SUBEXl* 

MAIN,SUBEX2,SUBEXU 

MAIN,SUBEX2,SUSEXl* ^ 

MAIN,SUBEX1,SUBEXU 

MAIN,SUBEXU 

SUBEX1 
SUBEX1 
SUBEX1 
SUBEX1 
SUBEX1 
SUBEX1 
SUBEX1 
SUBEX1 
SUBEX1 
SUBEX1 
SUBEX1 
SUBEX1 
SUBEX1 
SUBEX1 
SUBEX1 
SUBEX1 
SUBEX1 
SUBEX1 
SUBEX1 
SUBEX1 
SUBEX1 
SUBEX1 
SUBEX1 
SUBEX1 
SUBEX1 
SUBEX1 
SUBEX1 

SUBEX2 
SUBEX2 
SUBEX2 
SUBEX2 
SUBEX? 
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Table VI (Cont'd) 



Arrays with Variable Subscripts 
Array Subroutines with Fixed Dimensions 

Vl(NORD) SUBEX2 

V2(N0RD) SUBEX2 

V3(N0RD) SUBEX2 

VU(NORD) SUBEX2 

V5(N0RD) SUBEX2 . 

AHYP(MNCQ) also AHYP(NORD) SUBEX2 

DVEC(MNT) SUBEX3 

E(MAXD) SUBEX3 

IBEG(MNT) SUBEX3 

Nl(MNT) SUBEX3 

N2(MNT) SUBEX3 

Kl(MNT) SUBEX3 

K2(MWT) SUBEX3 

A(MNL,MTIii) SUBEX3 

B(MNL,MNL) SUBEX3 

C(MNL,MNL) SUBEX3 

D(MNL,MNL) SUBEX3 

HROW(MNT) SUBEX3 

V(MNL) SUBEX3 

LHE(MWHE,5) SUBEXli 

A(MNL,MNL) SUBEXli 

B(MNL,MNL) SUBEXli 

C(MWL,2*MNL) SUBEXli 

X(MNSIGC*80) SUBEXli 

PNAME(MNC) SUBEXli 

YD(MNL) SUBEXli 

IROWC(MNHE) SUBEXli 

D(MNL,MNL) SUBEXli 

AINTG(MNL) SUBEXli 

BDY(MNL-30) SUBEXli 

TOTNAM(MNL) (double precision) SUBEXli 
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Table VII 

Dimensioning Constants 

MNL=NORD= Maximum number of levels for a factor and one more than the 
maximum number of continuous variables 
Originates in MAIN, SUBEX1, SUBEX2, SUBEX3, and SUBEXH 
Standard version=l*0 

MNF= Maximum number of factors (including continuous variables) plus 1 
Originates in MAIN and SUBEX1 
Standard versJon=8 

MNCD=MNT=NM= Maximum number of entries in the data vector; maximum number 
of terms on the significance test card; or maximum number of 
matrices in core at one time (in SUBEX3) 
Originates in MAIN, SUBEX1 , and SUBEX3 
Standard version=258=2**MNF+2 

MNC= Maximum number of words in each term name on the significance test card 
Originates in MAIN, SUBEX1 and SUBEXH 
Standard version=3=12 characters 

MNCELL= Maximum number of cells for calculation of means and sigmas 
within cells 
Originates in SUBEX1 
Standard version=5l6=2**(MNF+l )+h 

MAXD= Maximum dimension of a vector in SCP 
Originates in SUBEX3 

Standard version=20 ,000 , greater than U*MNL**2 
(It is important that this be as large as possible without 
dictating the size of storage. It limits the size of the 
g sum-of-cross-products matrix. When the SCP matrix is not 

large enough to hold the entire set of parameters, two or 
mere passes are made in a loop which is the slowest part of 
the program. Making two or more passes through this loop 
greatly increases the cost of running the program.) 

MNCQ= Maximum number of words in QNAME (a significance test) 
Originates in SUBEX** 
Standard version=h0=l60 characters 

(Through an error, the printout will only list NORD words 
of the test statement.) 

MNSIGC= Maximum number of significance test cards 
Originates in SUBEXU 
Standard version=9 



Table VII (Cont'd) 

Dimensioning Constants 

NFMT= Maximum number of words in the variable format statement 
Originates in SUBEX1 
Standard version=200=10 cards 

MNHE= Maximum number of items in each of ^>he LHE lists 
Originates in MAIN and SUBEXH 
Standard version=20 

MNA= Maximum number of characters in a factor name (the alphabetic 
description on the contrast card) 
Originates in SUBEX1 
Standard version=20 

MNSV= Maximum number of variables to be printed across a page for 
cell means and standard deviations 
Originates in SUBEX1 
Standard version=8 
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Table VIII 
Alterable Constants 



Unnamed 


^ujmauu in ctiiu. o odiividLi u. version q.6 s lgnai^ ions 


KIN 


System input unit = 5 


KOUT 


System printed output unit = 6 


KPNCH = 


System punched output unit = 7 


I SAVE = 


Temporary storage unit = 19 


ITEMP = 


Temporary storage unit = 18 


ISTOR = 


Temporary storage unit = 17 


I KEEP = 


Temporary storage unit = l6 


IBAG = 


Temporary storage unit = 15 


ISIGT = 


Temporary storage unit = lU 


IOTFIL = 


Temporary storage unit = 13 


ISCPT = 


Temporary storage unit = 12 


ISWPT = 


Temporary storage unit = 11 


INFILE = 


Data input file if not KIN 


UNDER = 


Smallest number machine will handle = 1.0E-70 


OVER = 


Largest number machine will handle = 1.0E70 


CNEARO = 


Constant for controlling rounding error = # 000001 


INFO = 


Array for controlling analytic sequence 
IMP0(i5) to INF0(50) unused on June 1, 1972 
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Table IX 
Constants in INFO 



INFO(l) = 1 If analysis of variance 
= 2 If correlation 
= 3 If factor analysis 
= 5 If interaction tables 
6 If variance reduction 

1 If error sum of squares is residual 

2 If error sum of squares is generated from estimates and 
design parameters 

ANOVA: Degrees of freedom for hypothesis 
CORREL: Number of variables in hypothesis set 
= Order of error sum of squares matrix 
= ANOVA: Number of variables 

CORREL: Number of variables in error set 
= Degrees of freedom for error 
Controls battery reduction 
= 1 Efroymson's stepwise procedure 
= 3 Step-up procedure 
= 2 Wherry-Doolittle procedure 

= Probability level for adding and dropping variables in 
battery reduction 
and INFO(lO) Control standard analysis print-cut 
= 0 Print minimal sets 
= 1 Print intermediate sets 
= 2 Print everything 
= 3 Print INFO ( 10) output only 
) For ANOVA 
= 1 Print orthogonalized raw estimates 
= 2 Print disc. fn. coef. for standard scores 
= 3 Print disc- fn. coef. for raw scores 
= h Print transf. matrix for canonical contrasts . 
) For CORREL 

= 1 Print canonical raw score regression weights 

= 2 Print canonical raw score regression weights to obtain unit 

sum of squares regressed scores 
= 3 St. score and raw score weights for regressing hypothesis 
variables onto error variables 
INF0(10) For factor analysis 

= 0 Print minimal set 
= 1 Print everything 
INFO (10) For VARED 

= 0 Print dat i for hypothesis variables only 
= 1 Print data about design parameters, contrasts and 
correlations among hypothesis sums of squares 
INFO (11) through INFO(lU) Control rotation procedures 



INF0(2) = 



INF0(3) = 

INFO (10 = 
INF0(5) = 



INF0(6) 
INF0(7) 
IHF0(7) 



INF0(8) 

INF0(9) 
INF0(9) 



INFO (10 



INFO (10 



Table IX (Cont'd) 



Constants in INFO 



INFO(ll) = Number of factors in factor analysis 

= Criterion for determining number of significant canonical 
variates in MANOVA or canonical correlation rotation problems 
INF0(12) = 1 If Quartimax 
= 2 If Varimax 
= 3 If Equamax 
= h If Promax 
= 5 If multiple groups 
= 6 If orthogonal centroids 
7 If orthogonal bounds 

1 If factor analysis or direct rotation of canonical variates 

2 If indirect rotation of canonical variates 

1 If taxonomy of variables 

2 If taxonomy of groups 
Number of covariates 

1 Classical covariance adjustments 

2 Covariance reduction by error regression weights 
Number of extension variables 
L If a test with dimension reduction 

2 If a test with dimension reduction and for same 

3 If a test without dimension reduction but for same 
Number of dimensions which have been set up for reduction 
Number of observations minus number of parameters swept out 
1 If raw estimates only for ANOVA 
1 Principal components of dispersion 

= 2 Principal components of correlation 

= 3 Principal factor analysis 

= k Image analysis 

= 5 Canonical factor analysis 

= 6 Maximum likelihood 

= T Alpha 

= 8 Minres 
INF0(23) = 0 No communalities required 

= 1 If communalities start with S.M.C, 

= 2 If communalities supplied 
INF0(2h) = Number of factors to be removed and tested = INFO ( 11) 
INF0(25) = Significance tests procedure 

= 0 If none 

= 1 If Rao's 

= 2 If Rippe's 
INF0(26) = Blank 

INFO (27) = Power of .1 to be used as a convergence criterion (multiplied by 10) 
INF0(28) = Number of individual significance test cards 

INF0(29) = Number of calls to INDSIG (i.e., number of the current hypothesis test) 
INFO(30) = 1 If rotations are multiple groups, orthogonal centroios or 
orthogonal bounds 

INFO (31) Counts the number of individual significance test cards read in INDSIG 

INFO (32) = Current INP0(3) 

INF0(33) = INP0(3) for previous test 

INF0(3^) = 1 For calls to Wilks f rom DIMRED 



INF0(13) 
INFO(lU) 

INFO (15) 
INF0(l6) 

INF0(17) 
INF0(l8) 



INFO (19) 
INFO (20) 
INF0(21) 
INFO (22) 



Mathematical Notes 
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Mathematical Notes 

These notes are to describe the mathematical calculations involved 
in solving the linear model. As the steps are described, the subroutines 
which execute these steps will be indicated in capital letters within 
parentheses . 

The program is written as a full linear model. That is, as the vector 
of measurements for each observation are read into the computer, dummy 
parameters (or design parameters or fixed variables) are added to the 
vector according to the requirements of the linear model (DATVEC). The 
new data vector including dummy parameters and measurement variables is 
stored on tape or disk. 

When all observations have been read in and data vectors have been 
constructed, a sum-of-squares and cross-products matrix is then constructed 
(SCP). In order to accommodate analyses which have a large number of 
parameters, the sum-of-squares and cross-products matrix is not necessarily 
stored entirely in the machine. The program generates and stores as large 
a portion of this matrix as possible, then rewinds itself and generates 
another large portion of the matrix. Unfortunately when it is necessary 
to generate the sum-of-squares and cross-products matrices in portions, the 
expense of a run increases greatly. (Fortunately this is a very rare 
occurrence.) The sum-of-squares matrix is partitioned into submatrices 
which are completely determined by the effects listed on the model card. 
This matrix is stored on disk as partitioned blocks and elements within 
blocks rather than elements within a matrix. 

The complete sum-of-squares matrix is then reduced to the sums of 
squares and cross products for the various effects as required by the 
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model (SWEEP). This is done by sweeping out the effects listed in the 
model in the order in which they appear (from left to right) on the 
significance list card. A statement of the order appears on the print- 
out. The sweeping process is done as a straightforward Gaussian reduction, 
but instead of reducing the matrix to a diagonal an element at a time, the 
process reduces the matrix a partitioned block at a time. That is, the 
Gaussian sweep is generalized from an element at a time to a submatrix at 
a time. Otherwise, the process used is that described by Bock (1963). 

Subsequent to the sweeping process, the appropriate sums-of-squares 
and cross-products matrices for each linear analysis are assembled (SUBEXM 
and passed on to the appropriate analytic section. 

ANQVA and CORREL Solutions 

Analysis of variance and correlation problems stem :'rom the determinantal 
equation 

|a-xb|=o (i) 

where A is a matrix of the sum of squares and cross products for hypothesis 

and B is, for AiJOVA, the sum of squares and cross products for errors or, 

fcr CORRLL, the sum of squares and cross products for "totals" (using the 

convention that the sum of squares for errors plus the sum of squares for 

hypothesis ir the sum of squares for totals regardless of what other 

parameters are in the model for the data). 

The solution of this equation (ANAL) is accomplished as follows. 

1. The matrix 3 is scaled to correlations and A scaled 
compatibly. 
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2. The scaled B is decomposed (DECOMP) into a product of 
triangular matrices. This triangular matrix is then 
inverted and pre- and postmultiplied onto the scaled A. 
This results in reducing the original determinantal 
equation into a single matrix eigenvalue problem. 

3, The eigenvalues are calculated and eigenvectors determined (HOW). 
This description is the essential flow of the calculations. However, 

there are many side statistics calculated to provide expository material for 

the analyses. Table X lists the sid.e statistics and gives their formulations. 

In Table X the symbols have the following meanings. 

Z-the hypothesis variables: in ANOVA the dummy parameters, dummy 
variables design parameters, fixed variables or independent 
variables; in correlation the independent variables or predictors. 
They are assumed to have mean zero. 

X-the error variables: in ANOVA the observed variables, the 
dependent variables, the continuous variables: in correlation 
the criteria or dependent variables. They are assumed to have 
mean zero. 

When the determinantal equation (l) is written for correlation problems 
using Z and X, it can be written in matrix notation as 



The values for X are the squared canonical correlations. When the equation 
is written for analysis of variance, the matrix B is 



which for convenience of writing will still be notated X'X even though 



|(X f Z) (Z f X) 



-1 



(Z'X) - X(X f X)| =- 0. 



(2) 



B = (X f X) - (X'Z) (Z'ZT 1 (Z'X) 



it is a residual error or within-cells matrix. 



The values of X here 



become ratios of sums of squares. 



Z'Z 



A of equation 1 




is the diagonal matrix of the square roots of the diagonal of 
Z f Z. In correlation problems this is divided by a constant 
to give the standard deviations of the Z variables. 
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a x is the diagonal matrix of the square roots of the diagonal 
of X'X. In correlations problems this is divided by a 
constant to give the standard deviations of the X. In 
analysis of variance this is divided by a constant to give 
the standard errors of the X. (The matrix X'X is not 
the same in both solutions.) 

-1 -1 

Q is the triangular decomposition of o' x ^'Xa^ 

T is the matrix of eigenvectors of 

iQ'-^U'ZHZ'zr 1 (Z»X) a x ~V X - XI | = 0 

DFH degrees of freedom for hypothesis 
DFE degrees of freedom for error 
The final matrix equation of the solution can be written 

X" 1/2 TQ^ 1 a x " 1 (X»Z)(Z»Z)" 1 (Z»X)a x V 1 T , X^ /2 = I. 

The various parts of this formulation are entered in Table X and cross- 
referenced to Table I to help the reader recognize the printed output if he 
is not familiar with the style 'of nomenclature used here. 

Factor Analysis Solutions 

The factor analytic solutions available in VARAN are common ones and 
the nomenclature used is standard. The subroutines for doing the maximum 
likelihood, ALPHA and MINRES procedures, were written by Michael Browne 
and we^e merely adapted for use here. These routines print out information 
which is not generally used but was retained nevertheless. 

Interaction Tables 

The mathematics of calculation of interaction tables is simple enough. 
The parameters for nesting within the cells of the desired interaction are 
generated, and the estimates for th?s design are calculated, viz: WAB 
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produces the estimates for the cells of the AB face of a factorial design. 
The grand mean must not be included in INTABL models. 

The acronym INTABL concludes the ANOVA calculations when estimates 
have been obtained, then prints them out* 

Variance Reduction Analysis 

Variance reduction analysis is a simple procedure for evaluating the 
effect of imbalance in an ANOVA design due to unequal cell sizes. Basically, 
the hypothesis sums of squares for the given test is generated both before 
and after taking account of the inequality of cell sizes. (The grand mean 
is not removed from the data.) The "before" sums of squares is assumed 
to be ICO per cent of the data available. When the "after" sums of squares 
is larger than the "before" a negative loss is indicated. 

Dimension Reduction Analysis 

This feature is available through the significance test card and is 
indicated by a minus sign (-). In use it pertains to a feature of multi- 
variate analysis of variance and canonical correlation which has no uni- 
variate analog. 

Tc explain its use, suppose we are given two independent statistical 
hypotheses, both involving the same error variables, such as an interaction 
and a main effects test in a MANOVA design oi p variables. Suppose the 
interaction test, AB=V, has r < p significant dimensions (or roots). 
These r dimensions of the p dimensions of error have the same 
properties as r variables in r univariate analyses and suggest that 
the main effects of these r variables need not be examined further. How- 
ever, the remaining p-r dimensions of error are not interactive and can 
be reasonably tested over the main effect A. 



The test of the main effect of A on the p-r dimensions of error 
is reasonable and can be effected in at least two ways: (l) analysis of 
covariance, using the significant interaction discriminant variables as 
covariates, and (2) obtaining p-r linear combinations of the error 
variables which are independent of the significant interaction discriminant 
variables and analyzing these # 

The method of dimension reduction removes the significant interaction 
discriminant variables from the sums of squares for the test A=V. This 
forces those same linear combinations of the error variables to have zero 
roots in the determinantal equation Ja-AB| = 0, thus removing them from 
the analysis. The main effects test A=V is then calculated, and the 
multivariate probability tests are compiled as if there were only p-r 
variables in the test. The univariate F ratios are not altered* 

The notation on the significance test card to produce this test of A 

is 

AN0VA:W=0 ,A=V-AB=V. 
with an individual significance test card used to indicate the probability 
level in the test of AB=V which determines significance and the number of 
canonical variates to be removed from A=V. 

* 

In addition to this application to analysis of variance, dimension 
reduction can also be applied to canonical correlation, 

CORREL : W=0 , V1=V2-V3=V2 . 
and as a type of analysis of covariance, 

ANOVA : W=0 , A=V1-V2=V1 . 



Sample Input and Output 
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SAMPLE INPUT 

//tfcH***02 JUB ( 6600, bb, 10, 9, 999) , '04161 , TKN , h A LL CHA S • ,CLASS=b 
//STtPl bXbC PGi*i=VAkAN,kbGIUi\J=( ,l6bK) ,T lMb=b 
// STEPL lb i)D US)M=HALLLIb v UIMl 1 =2314 , VUL=Sbk=RbbL lb ,Ul SP = SHR 
UL>NAtob = S YSIN 

SYSUUT=A,UCB = ( kbCHM=hbA , LKbCL = 1 3 3 , b LKS I Zb=3458 ) 
SPACb=(CYL, ( * ' " " " 



SPaC b= ( C YL , ( 
SPACb=(CYL, ( 
SPACb= ( C YL , ( 
SPACb= (CYL, ( 
SPACe=(CYL, ( 
bPACb= (CYL, ( 
SHACt= (CYL , ( 
SPACb= (CYL , ( 



//HTObFOOl [)D 
//HT06H001 1)1) 
//HTllf-uol liU 
//H'12h001 
//H13F001 IJD 
//HT14H001 uu 
//HTlbf-Ool OU 
//hT16h()0l i)D 
//HT17H001 UU 
//HTlHhOOl I M i 
//FT19F001 UU 
//bYSlN * 
lITLb PRUbLbM o, TwU WAY hACTURlAL 
TITLb MAX I ANUVA PkliMlUUl 

TIlLt CbLL nhAiv'b PklhObU 
rilLt PKImT KbDUCbU i^'UUbL MATklX 
TITLb TbST PkUbLbft bklm HALL Ai^D CkAf*»fcR 
PKllb 13 11 2 
2 , , , , hACTUk A. 

2 , , , * hAC'l UK b. 

6 1 , , , , Cll^TI^ULJlib 
bKkUK l bkKUk 2 bkkuk 3 
AiMUVA :W = 0, A = V,b = V ,Ab = V. 
(2Ili lOXi 6H6.0) 



)*,uISP = imHw,UivI T = UJ bK 
) ,UlSP = IMbW,UIMri=UlbK 
) ,UlSP = NbW,UlvI 1=1)1 bK 
) ,UlSP = N(-W,ONlT=UlSK 
) ,UlSP = lMb W,UNI T=Ul bK 
) ,Ul SP = iMbW ,UK'IT = U ISk 
) ,UlSP = iMbW,ONll = Ul i>K 
) ,Ul SP=iMbW tUlMll =Uli>K 
) , UlbP = Nh W,Ul\*n=Ul bK 
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Til Lb SPECIAL CUNTRaSTS, HUMUGtNtlTY Ob RbORc bS IUN, CANONICAL CURRbLAT I UN , 

TITLE ROTATION UF CANONICAL VARlATbS, NAMfcU CONTRASTS AlVU MAXImUN 

TITLfc CUKRfc LAT I U'" PKIimTuUT 

TITLE TbST PROttLbM bROC" HALL AND CKAPifcR 

PKUtf 12 1 2 

A 1 4 1 ,,11122122,, SPECIAL CUInTKASTS. 

1ST-4TH 2N0-3RU LbbTOVbR 

1.0 1.0 1.0 1.0 

l.o -l«o 

1.0 -l.o 
1.0 -l.o -1.0 l.o 

V 6 13 3,,,,VARlAbLbS. 

bRROR 1 bRROR 2 bRRUR 3 bRRUR 4 bRRUR b ERROR 6 



CORREL :W = 0, 


A=0 ,V2 


=V1 ,AV2 


=V1 . 








(12, 


10X,6F6 
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CORPELaTIUnS bETWEEiM THE ERRUk VARIABLES AWO THE 
ROTATED CAKjMCAL VAR I AT tS 

CANUnICaL VAklATE 
VARIABLE 1 2 3 



ERRUR 1 
ERROk 2 
ERROK 6 



0.9916 
-0.061b 
-0.1250 



-0.0594 
0.96*0 
0.24b4 



-0. 1149 
0.2413 
0.9011 



TkawSKURwaT IOi : .ATR I X 

'i>A:^. VECTUK 
CAh' • VaK« 1 



0.6cH)d -0.4192 
-u.2vW 0.5bSV 
0.6 V 07 



3 

0.6007 
0. 7r>0^> 
-o.27i>6 



SUi'-Ul — S<HMKfcS maTRIX OH RUTaTEi; hYPUfhESIS CAMOiMI CAL VAklATES 

CanuMCaL VaRIaTH 

1 2 3 

1 0.1151 -0.0747 0.0669 

2 -0.0747 u.07ot -0.0ii>7 

3 (n 0669 -0.0157 0.1333 

-clbHTS FOR KfcGKhSSIwG EKKUK VaRIa^LES • Ii - 1 U kUTATftP Ek'<M»R CA» O ICAL VARI;\"I 

CaiHjnICaL VaRIa'U: 

VARIABLE 1 2 3 

ERRUk 1 0.0065 0.0002 0.0008 

CKKUR 2 0.0017 0.0r>5^ -0.0139 

ERRUR 3 0.0047 -0.0113 0.0467 



TEST WITH fiO ERROR i^JT DUNE 
a=0 



TEST WITH NO ERROR k'UT DONE 
*"=0 
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VaRAjm: LI HEAR MODEL VARIANCE ANALYSIS 
FIRST EDITION: JUNE 1, 1972 
EDUCATIONAL TESTING SERVICE 
PRINCETOc, w.J. 08540 
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